VPsearch: fast exact sequence similarity search for genomic sequences
نویسندگان
چکیده
منابع مشابه
A fast algorithm for exact sequence search in biological sequences using polyphase decomposition
MOTIVATION Exact sequence search allows a user to search for a specific DNA subsequence in a larger DNA sequence or database. It serves as a vital block in many areas such as Pharmacogenetics, Phylogenetics and Personal Genomics. As sequencing of genomic data becomes increasingly affordable, the amount of sequence data that must be processed will also increase exponentially. In this context, fa...
متن کاملSW#db: GPU-Accelerated Exact Sequence Similarity Database Search
In recent years we have witnessed a growth in sequencing yield, the number of samples sequenced, and as a result-the growth of publicly maintained sequence databases. The increase of data present all around has put high requirements on protein similarity search algorithms with two ever-opposite goals: how to keep the running times acceptable while maintaining a high-enough level of sensitivity....
متن کاملQuerying Timestamped Event Sequences by Exact Search or Similarity-based Search: Design and Empirical Evaluation
Specifying timestamped event sequence queries is challenging even for skilled computer professionals familiar with SQL. Most graphical user interfaces for database search use a exact search approach, which is often effective, but applies an exact match criteria. We describe a new similarity-based search interface, in which users specify a query by simply placing events on a blank timeline and r...
متن کاملSimilarity Search In Sequence
We propose an indexing method for time sequences for processing similarity queries. We use the Discrete Fourier Transform (DFT) to map time sequences to the frequency domain, the crucial observation being that, for most sequences of practical interest, only the rst few frequencies are strong. Another important observation is Parseval's theorem, which speciies that the Fourier transform preserve...
متن کاملSimilarity Search for Multidimensional Data Sequences
Time-series data, which are a series of one-dimensional real numbers, have been studied in various database applications. In this paper, we extend the traditional similarity search methods on time-series data to support a multidimensional data sequence, such as a video stream. We investigate the problem of retrieving similar multidimensional data sequences from a large database. To prune irrele...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of open source software
سال: 2022
ISSN: ['2475-9066']
DOI: https://doi.org/10.21105/joss.04236